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METHOD AND APPARATUS FOR DIGITAL VECTOR QAM MODULATOR 

This application Claims priority under 35 U.S.C. 119 from Provisional 
Application Serial No. 60/513,985 filed October 27 th 2003. 

This invention relates generally to telecommunication systems. The 
5 present invention relates more specifically to a method of synthesizing a direct QAM 
modulated RF signal with high power efficiency for use in telecommunication 
systems. 

RELATED APPLICATIONS 

This application is related to applications filed on the same day by the 
10 same inventors under Attorney Docket 85195-602 ADB entitled APPARATUS FOR 

FRACTIONAL RF SIGNAL SYNTHESIS WITH PHASE MODULATION and Attorney 

Docket 85195-402 ADB entitled APPARATUS FOR FRACTIONAL RF SIGNAL 

SYNTHESIS the disclosures of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
15 A prior art arrangement is shown and described hereinafter and has a 

number of disadvantages which will become apparent from the description 

hereinafter. 

A search has revealed the following US Patent references: 
5,329,259 Stengel, "Efficient Amplitude/Phase Modulation Amplifier" 
20 5,612,651 Chethik, "Modulating Array QAM Transmitter" 

5,659,272 Linguet, "Amplitude Modulation Method and Apparatus 
using Two Phase Modulated Signals" 

5,867,071 Chethik, "High Power Transmitter Employing a high Power 



QAM Modulator" 

6,366,177 McCune, "High-Efficiency Power Modulators" 

5,852,389 Kumar, "Direct QAM Modulator" 
SUMMARY OF THE INVENTION 

5 

According to the invention there is provided an apparatus for directly 
generating a QAM RF signal comprising: 

a high speed reference clock providing in an input signal having a 
. series of pulses at a frequency of the reference clock which is higher than the 
10 desired output frequency; 

two programmable digital delay elements each arranged to receive the 
reference pulses of the input reference clock and to generate therefrom using input 
data a respective one of two digital vectors; 

and a signal combining element for receiving the digital vectors from 
15 the programmable digital delay elements and for generating the QAM RF signal 
therefrom. 

Preferably there are provided amplifiers for amplifying the digital 
vectors non linearly before combining. 

Preferably the programmable digital delay elements comprise high 
20 speed adders/accumulators wherein said adders/accumulators are arranged to 
determine the amount of delay implemented by the delay elements on the reference 
signal. 

Preferably the output frequency is set from an increment value 
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according to the following equation: 
Increment Value = ((f ref / f ou t) -1 ) * 2 n 
where f re f = Reference clock (1 03) frequency 

fout = Output (110) frequency 
5 n = Number of bits in the accumulator math. 

Preferably the duty cycle is set by initializing the difference of the 
initializing values of the two accumulators according to the following equation: 

The reference clock frequency divided by the desired output frequency 
multiplied by 2 An multiplied by (p/100), where p is the percentage duty cycle and n is 
10 the number of bits in the accumulator math. 

Preferably the worst case frequency resolution is determined by the 

equation: 

The reference frequency divided by 2 An , where n is equal to the 
number of bits in the accumulator. 
15 Preferably a non-linear amplifier is used to produce a high RF output 

power, from the sum of two phase modulated vectors. 

Preferably the duty cycle of the output can be varied by changing the 
difference in the start values of the accumulators for the rising and falling edge delay 
control. 

20 Preferably the interpolator is a linear interpolator. 

Preferably the interpolator is a sin x/x interpolator filter. 
Preferably the need for a reconstruction filter is removed by 
interpolation up to the reference clock rate. 
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Preferably phase delay of the programmable delay is calibrated using 
a look up table or Microprocessor. 

Preferably separate delay controls are used for producing the rising 
and falling edges of the output from the same input edge of the reference clock. 
5 Preferably the reference edge of the reference clock is delayed by the 

programmable delay lines. 

Preferably the reference edge may be either the rising or falling edge 
of the reference clock. 

Preferably the carry bits (overflow bits) are used to control a pulse 
10 swallowing circuit to extend the delay to multi cycles of the input reference clock. 

Preferably the clock swallow circuit can ignore/block multiple reference 
clock pulses thus giving the delay line endless delay capability. 

Preferably the clock swallow circuit can be located prior to or following 
the programmable delay line. 
15 Preferably a set reset flipflop is used to combine the separate rising 

and falling edge delays to form any desired duty cycle output. 

Preferably the output duty cycle is not dependent on the input duty 

cycle. 

Preferably increasing the number of bits in the adder math increases 
20 the frequency resolution with negligible degradation in the phase noise performance. 

Preferably the number of bits of math used in the adder can be equal 
to or exceed the number of bits of control in lookup table and /or the programmable 
delay. 
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Preferably the speed can be increased using parallel processing in the 
adders, and/or accumulators. 

Preferably the adders/accumulators can be implemented in a larger 
lookup table wherein all the answers of the pattern are pre-computed and stored. 
5 Preferably an optional arrangement could include plurality of adders, 

accumulators, pulse swallow circuits, lookup tables, and programmable delay lines 
are used. 

Preferably the lookup table has a multiple set of lookup tables to be 
used for temperature compensation of the programmable delay line. 

10 Preferably the implementation is done fully digitally in an ASIC with no 

requirement for a voltage controlled oscillator, loop filter, or Digital to Analog 
converter used in prior art solutions. 

Preferably an optional arrangement could include filtering of the output 
to produce a signal having less harmonics. 

15 Thus the arrangement described herein pertains to a new method and 

apparatus to produce a fully digital QAM modulated frequency agile RF signal. It is 
based on the summation of two fixed amplitude digital vectors each of which is 
synthesized from a high fixed-frequency reference clock. Pulse stretching is used to 
delay each edge of the reference clock to the desired time. Clock edges are 

20 swallowed in conjunction with the delay to reproduce the clock edge that synthesizes 
any desired lower frequency. Phase modulation of the two signal vectors is 
achieved through the control of the delay with the modulating signal. The invention 
results in direct high output power, high frequency resolution, low phase noise , wide 
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frequency setting ability, and fully digital ASIC implementability. It also results in 
superior power efficiency performance. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a Prior Art IQ QAM modulator. 
5 Figure 2 is a block diagram of a System for QAM RF signal synthesis 

according to the present invention. 

Figure 3 is a Timing diagram for the Sample shown in Table 1 . 

Figure 4 is a graph showing a Sampled Baseband Spectrum in the 
system of Figure 2. 

10 Figure 5 is a graph showing a Linearly Interpolated Baseband 

Spectrum the system of Figure 2. 

Figure 6 is a graph showing a Sampled Baseband Frequency 
Spectrum the system of Figure 2. 

Table 1 shows Sample timing calculations for single Vector of the 
15 present Invention. 

DETAILED DESCRIPTION 

A prior art, architecture 15 for a QAM modulator 17 is shown in Figure 
1. The modulator 17 accepts a digital input 19 which is fed to an encoder 23. The 
encoder 23 divides the incoming signal into a symbol constellation corresponding to 
20 in-phase (I) (x r (nT)) and quadrature (Q) (j x i( n ~0) phase components while also 
performing forward error correction (FEC) required for subsequent decoding in the 
demodulator. The converter's outputs are fed to the two identical finite impulse 
response (FIR) square-root raised Nyquist matched filters 25, 27. These Nyquist 
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filters 25, 27 are a pair of identical interpolating low-pass filters which receive the I 
(x r (nT)) and Q (jXj(nT)) signals from the encoder 23 and generate real and imaginary 
parts of the complex band-limited baseband signal. The Nyquist filters 25, 27 
ameliorate intersymbol interference (ISI) which is a by-product of the amplitude 
5 modulation with constrained bandwidth. After filtering, the in-phase ((y r (nT'))) and 
quadrature (yi(nT')) components are multiplied 29, 31 with the IF centered carrier 
signals 33, 35. The multiplied signals are then summed 37 producing a band limited 
IF QAM output signal (g(nT)). The digital signal is then converted to an analog 
signal using a D/A converter 39. The analogue signal is processed 41 and fed to a 

10 linear power amplifier for amplification and transmission. Due to the limited 
frequency range of the D/A converter 39, the analog signal processing 40 may also 
contain upconversion to convert the IF frequency from the D/A output 39 to an RF 
frequency signal. This method requires a large, very linear power amplifier as the 
modulation must be produced at low power This consequently results in very poor 

15 power efficiency. 

The present device is arranged to synthesize a direct QAM modulated 
signal digitally. This is achieved by summing two digitally produced phase 
modulated vectors which together implement the required phase and amplitude 
modulation for the QAM signal. The amplitude modulation is only generated at the 

20 last step so that all previous functions are handled in the digital domain. Therefore, 
the amplification of each vector can be done by a non linear and very efficient 
amplifier as each vector has only phase modulation and no amplitude modulation. 
Further, each modulated vector is produced with high resolution from a fixed- 



frequency high speed reference clock. Figure 2 presents a block diagram of the 
invention. The high speed reference clock 103 would typically be an external input 
with high frequency absolute accuracy and very low phase noise performance. 
Examples of sources are well known in the art and include high frequency crystal 
5 oscillators, SAW oscillators, and crystal oscillators with harmonic multiplication. 

The device delays an edge of the reference clock by an amount which 
is controlled by the modulation adder 102 and implemented by the programmable 
delay 106. The reference edge could be either the rising or falling edge of the 
reference clock. There are separate circuits for the control of the two edges so that 

10 the rising and falling edge of the output signal 150 can be independently controlled. 
This ensures that even if the duty cycle of the input reference is not 50%, the output 
150 duty cycle can be controlled as both the rising edge and falling edge delay is 
triggered from the same edge of the reference clock 103. The desired output duty 
cycle is typically 50% to maximize the RF power in the fundamental frequency but 

15 any desired duty cycle can be achieved. Duty cycle is controlled by setting the initial 
value 111. The frequency of the RF output is selected by loading the increment 
value 100. The operation is controlled by two equations. The first equation controls 
the RF output frequency and it determines the value to be loaded in the increment 
value register 101. Given that the high speed adder/accumulator 102 is comprised 

20 of 2 An bits, where n is the number of bits in the accumulator math, the increment 
value 101 is given by the following equation: 

Increment Value = ((f ref / f ou t) -1) * 2 n 

where f re f = Reference clock (1 03) frequency 
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fout = Output (110) frequency 

n = Number of bits in the accumulator math. 

Table 1 shows sample calculations for an example where the high 
speed reference clock 103 is 1000 MHz and the desired output RF frequency is 
5 734.313739 MHz. A value of n = 12 with 12 for 12 bit adding operations is used. 
Using these numbers in the frequency setting equation yields an increment value 
101 of 1482. This increment value is added on each clock cycle to the accumulator 
to produce a new accumulator value. 

The second equation controls the duty cycle of the output. As shown 
10 in Figure 2, there are separate blocks to control the rising edge delay (a) and the 
falling edge delay (b). To accomplish a fixed duty cycle, the increment values 101a 
and 101b must be the same and the initial start up values 111a and 111b in the 
accumulator must be set to provide for the desired fixed delay between them. The 
equation for the initializing value 111b assuming the initializing value for 111a to be 
15 zero is as follows: 

Initializing Value (111b assuming 1 1 1 a is 0) = (f re f / fout) * 2 n * (p/1 00) 
where f re f = Reference clock 103 frequency 
fout = Output 110 frequency 
n = Number of bits in the accumulator math 
20 p = Percentage duty cycle 

For the example shown in Table 1, for duty cycle p = 50 %, the 
initializing value 111b is calculated to be 2789. Table 1 illustrates that the 
adder/accumulator 102a starts at 0 and increments 1482 at every rising edge of the 



10 



clock. At the same time adder/accumulator 102b starts at 2789 and increments 
1482 every rising edge of the clock. Any phase modulation required is added in a 
second modulation adder 120. When the modulation adder 120 overflows and 
produces a carry out due to the math addition, an input pulse edge must be ignored 
5 or "swallowed". This corresponds to phase wraparound, i.e. the phase shift has 
reached 360 degrees and must be set to 0 degrees. In the present invention, 2 An is 
calibrated to equal 360 degrees of the reference clock input 103. This calibration is 
performed in the LUT 105 by a simple mapping of input control bits to desired control 
lines. The filling of the LUT 105 to perform this requirement would be well 

10 understood by those skilled in the art. The LUTs 105 can be implemented using a 
read only memory or with a microprocessor. The adder/accumulator overflows due 
to an addition indicates a greater than 360 degree delay requirement. This delay is 
implemented by using the next clock edge rather than delaying from the original 
clock edge. This allows the programmable delay line 106 to act as a delay line with 

15 endless delay capability. For example if the accumulator is using 12 bit math then 
360 degrees is equal to 2 a12 or 4096 . In the example shown in Table 1, the 
accumulator overflows to 4446, which means the overflow bits are set to a value of 1 
and accumulator value goes to 4446 - 4096 = 350. The circuit implements the 
requirement for this value of phase delay in two parts. It activates the pulse swallow 

20 circuit to ignore one clock edge, and sets the programmable delay to 350 which 
completes the rest of the delay requirement. This unique feature of the present 
invention means that any quantity of overflow bits could be handled. If the addition 
of the increment value 101 to the accumulator value 102 causes, for example, two 
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overflow bits, then the pulse swallow circuit 104 would ignore or "swallow" 2 pulses. 
In this way it is possible to synthesis very low frequencies 108 from the high speed 
clock reference 103. The delay required to achieve this is limited to one cycle at the 
high speed reference clock rate. Furthermore, the accuracy of the timing and jitter is 
5 excellent, as the time is always relative to the closest edge of the high speed clock 
reference 103. The output signal phase noise is not controlled by the loop 
bandwidth nor the phase noise characteristics of the voltage controlled oscillators 
applied in traditional methods. Instead, the phase noise performance is directly 
linked to the high speed reference. This reduces both the jitter and phase noise of 

10 the synthesized RF output 108. The delayed edge from the programmable delay 
106a sets the output RF high 108 by enabling a set-reset flipflop 107. When the 
delayed edge from the programmable delay 106b reaches the flipflop, it resets the 
flip flop 107 and causes the RF output 108 to go low. This completes the synthesis 
of the RF output 108 at the preferred 50% duty cycle rate. Figure 6 illustrates time 

15 plots for the example in Table 1. The upper plot is the high speed reference clock 
plotted over 5500 degrees. The lower plot is the RF output 108, plotted over that 
same 5500 degrees of phase shift with respect to the reference clock. The lower 
plot demonstrated the synthesis of a lower frequency from the high speed reference 
clock. 

20 The frequency step size of this invention depends on the frequency 

and the number of bits n in the accumulator math. It is coarser at frequencies closer 
to the reference clock frequency, and finer at lower frequency outputs. The worst 
case step size is the reference frequency divided by 2 An , where n is equal to the 
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number of bits in the accumulator math. In the example of Table 1, the step size is 
1000 MHz divided by 2 A ". This gives a step size of approximately 244 KHz. To 
improve the frequency resolution an increased number of bits in the math can be 
used. For example with 16 bit math, the frequency resolution improves to 
5 approximately 15.2 KHz. Increasing n to 32 bits would result in approximately 0.2Hz 
frequency resolution. It is only necessary to increase the number of bits of 
resolution in the adder/accumulators 102, and not necessarily the LUTs 105 and the 
programmable dividers 106. The remaining least significant bits can be truncated 
before the LUTs 105 with negligible effect on the RF output 108 phase noise quality. 

10 This means that very fine frequency resolution is achieved with negligible 
degradation in the phase noise. It can also be seen that the increment values 101 
can be changed to provide an essentially instantaneous frequency change. 

Phase modulation is added by the addition of a second adder 120. 
This adder is also high speed and runs at full rate. This modulation adder 120 adds 

15 the desired phase offset to the accumulator value 102 to provide a new increment 
value that is sent to the look up tables 105 and the pulse swallow circuit 104. The 
number added could be positive or negative. The average value added is always 
zero over a long period of time. This ensures the overall effect of the modulation 
adder is only a phase modulation and not a change in the center frequency of 

20 operation. Compared to the reference clock frequency, the modulation information 
(122,123) is at a much lower frequency baseband rate. Figure 4 illustrates an 
example of the incoming sampled baseband using 8 samples per symbol. Graph 
200 is the desired phase rate signal control. Graph 201 is the sampled input. If the 
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graph 201 is placed through a reconstruction filter the desired shape 200 will be 
produced. This is illustrated in spectrum plot of figure 6. The energy of the sampled 
waveform 201 is spread over the desired baseband 400 and the clock 404 and 
aliasing components 402 and 403. A low pass filter 401 is used in prior art, after a 
5 DAC to remove the undesired clock 404 and aliasing components 402 and 403. 
However, in the present invention there is no DAC as the phase modulation is 
achieved by directly adding digitally to the increment value. There is no place to put 
an analog low pass filter. This would result in clock and aliasing signal components 
showing up in the RF output 150. To overcome this problem an interpolator 121 is 

10 used to reduce the clock and aliasing signals as well as to shift their frequency so 
that they may be filtered at the RF output 150 using an optional band bass filter 109. 
The preferred embodiment of the interpolator is a linear interpolator. However, it is 
also valid to use other interpolation techniques such as sin x/x interpolation and 
filtering. Sin x/x interpolation is well understood by those knowledgeable in the art. 

15 Linear interpolation is implemented by drawing a straight line between two known 
points. This is simple to implement as the increment value required for each 
reference clock cycle is based on the equation: Input sample frequency 122 divided 
by the clock reference frequency 103 multiplied by the difference of two adjacent 
sampled data point values. An implementation of the interpolator 121 used for 

20 suppressing the clock and aliasing components is shown in Figure 5. The linear 
interpolated curve 301 now has more power in the desired curve 300 than the non 
interpolated curve 201. A full sin x/x interpolater would remove the clock and 
aliasing component as the phase adjust would occur at every reference clock edge. 
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This alleviates the need for any reconstruction filter which is now replaced with a full 
digital solution that can be implemented using an ASIC. 

Another advantage of the present device is that the output signal 
frequency 150 range is very wide. The pulse swallow 104 circuit can block multiple 
5 reference clock pulses extending the programmable delay indefinitely. This is only 
limited by the number of overflow bits and math bits used. The output frequency 
range coverage can thus be from DC up to the high speed reference clock 
frequency. It is desirable to have as high a reference clock frequency as possible. 
A higher reference clock frequency extends the useful frequency range, and 

10 improves the frequency resolution. The upper reference frequency limit of the 
design is mostly limited by the design speeds of the high speed adders/accumulator 
102 and look up tables 105. It is understood in the art that speeds can be increased 
by parallel processing and other design techniques. For example multiple high 
speed adders/accumulator, LUTs or programmable delay lines could be used in 

15 parallel for increasing the speed and hence the output signal frequency capability of 
the invention. 

The two synthesized RF signals 150 and 154 can be phase modulated 
independently. The first vector circuit 140 is phase modulated from the bit control 
. inputs of 123 and 122. The second vector circuit 141 is phase modulated from the 
20 bit control inputs of 145 and 146. These two vector circuits 140 and 141 share the 
same high speed reference clock 103, and frequency load increment value (100). 
The circuits of 140 and 141 are digital circuits with digital input and outputs. If 
required, these digital signals can be amplified with 151 and 153 to increase the 



level of each phase modulated vector. Each vector is still digital and contains no 
amplitude modulation, so amplification can be done with a non linear, very power 
efficient amplifiers (151 and 153), such as a class C amplifier. The output of the 
amplifiers are combined together in a combiner 152 resulting in an output that has 
5 both phase and amplitude modulation. The peak power corresponds to the sum of 
the two vector powers. The output of the combiner 152 may be optionally filtered 

155 to remove harmonics. The result is a phase and amplitude modulated signal 

156 that is controlled through the input phase control of Vector A (123,122) and 
Vector B (145,146). The modulation is valid for any level of QAM. 

10 Within the spirit of the invention it is also possible to implement the 

invention on every 180 degrees of the reference clock using both the rising and the 
falling edges. Another alternative arrangement is to position the clock swallow 
circuit following the programmable delay line. 

Within the spirit of the invention it is also possible to remove the 
15 adder/accumulators (102) and replace the LUT (105) with a larger LUT. A simple 
counter could increment the values in the LUT. The LUT (105) would in this case 
hold the pre-added values, and just cycle through them until the pattern repeats. 

Within the spirit of the invention is it also possible to compromise 
latency for the speed of the device. It does not matter how many clock cycles it 
20 takes to implement an adder or LUT for example, as long as we get valid data out 
every reference clock cycle. 

It is possible to use a selection of different lookup tables (105) or offset 
values to compensate for the temperature effect on the programmable delay lines 
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(106). It is also possible to vary the implementation of the delay lines by altering the 
input clock signal. Examples of clock alteration would include frequency 
multiplication, division, or phase shifting. 

Since various modifications can be made in my invention as herein 
5 above described, and many apparently widely different embodiments of same made 
within the spirit and scope of the claims without department from such spirit and 
scope, it is intended that all matter contained in the accompanying specification shall 
be interpreted as illustrative only and not in a limiting sense. 



